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USER SELECTED BACKGROUND NOISE REMOVAL FOR SCANNED 

DOCUMENT IMAGES 



FIELD OF THE INVENTION 
[0001] The present invention relates to a system and method of image processing, and 
in particular this disclosure provides a system and method for removing unwanted 
background image data from scanned document images by user selection. 



BACKGROUND OF THE INVENTION 

10 [0002] Scanners and copiers are well-known office machines that provide valuable 
M office functions both in the workplace and at home. One important component of 

g these scanners and copiers is the image processing that automatically filters noise 

09 (unwanted information) from the scanned document. 

N 

m 

gj [0003] Two-sided documents, i.e., documents having text and /or pictorial content on 

f 15 both sides of the paper, present challenges for producing quality copies of the original 
Cj documents. When two-sided documents are scanned in a copy machine or a scanner 

fy visual noise may appear in the copies that was not present on the scanned surfaces of 

W the original documents. The visual noise may be the result of digitally captured text 

r|| and/or pictorial content printed on the opposite side of a scanned surface commonly 

20 referred to as bleed-through. Bleed-through is more prevalent for copies of documents 
having a white or very light color background. In addition, the thickness of the 
scanned documents may increase the intensity of the bleed through effect, since 
thinner paper is more transparent than thicker paper. Moreover, when scanning a 
paper document, the paper is generally not captured as exactly white as a result of 
25 inaccurate calibration of the scanner with respect to each paper type. The combination 
of the non-white background and bleed-through both reduce visual quality of the 
scanned documents, especially when printing them, since slightly off-white color is 
typically rendered as scattered dots by printers. 
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[0004] Certain known background noise removal techniques consist of estimating the 
background color of the scanned-in document by analysis of local or global tone- 
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statistics of the document image, and then applying a local or global tone-mapping 
which maps the estimated background color or lighter tones to pure white. 

[0005] Fig. 1 A illustrates the steps of one type of prior art scanning or copier system 
including scanning 10, image processing 11, rendering 12, and displaying or printing 
5 13 a document image. Frequently, this type of system uses a type of processing 
technique referred to as "single pass" since typically all operations are performed 
within the system (e.g., copier) in a single irreversible processing pass. A system 
designed using the "single pass" technique generally only allows for a small amount 
of buffered image data for processing. The entire image is not stored and all 
10 processing is performed within the system such that there is no need for image 

compression, transmission, decompression, or off-line computing. Processing is often 
performed cumulatively and then processed data is immediately printed. As shown in 
Fig. 1 A, background noise removal is performed during image processing. To date, 
this background noise removal technique has been irreversible in the "single pass" 
fi) 15 type system. In other words, once the background noise removal technique has been 
, performed the original image data of the document is no longer available due to the 

&j single pass processing technique which can potentially cause a loss of visually 

flj important information. In some cases background noise removal may be undesirable 

S3 

g if the type of document image does not lend itself to typical background noise removal 

Hi 20 processes and removal results are typically poor. However, according to this system, 
if the background noise removal technique does not perform satisfactory, the user does 
not have the option of regaining or using the visual information lost during the 
background removal process. 

[0006] Fig. IB shows the steps of another type of known scanning or copier system 
25 which can be formed by using an add-on background removal function with a pre- 
existing scan system. According to this type of system, image data is stored and 
available for performing background removal operations, however, the background 
removal operations are performed inefficiently within the system. Specifically, a 
document image is scanned 14 and processed 15 to enhance the image. After 
30 scanning and processing, the processed image data is stored 16. Background noise 
removal 17 is performed by accessing the stored image and performing all of the 
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necessary statistical operations to derive a background noise removal function. These 
operations often require highly computational and complex statistical image data 
analysis. Hence, each time background noise removal is performed in the system 
shown in Fig. IB, these highly computational steps are repeated. Often, the analysis 
5 performed during background noise removal is similar to or the same as the analysis 
performed during image processing step 15, however, according to this type of prior 
art processing technique these highly computational analysis and processing steps are 
repeated since all of the background removal is performed after the image has been 
stored and the prior analysis results are not made available for use during background 
10 noise removal. Consequently, this type of system performs background noise removal 
in an inefficient manner. 

M 

n [0007] Hence a need exists for an efficient system and method of background noise 

jjjg removal of a scanned document that also provides a user with the ability to reverse 

Sj and/or control the background noise removal process after scanning the document. 
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SUMMARY OF THE INVENTION 

[0008] A system and method of removing background noise from a digital image of a 
3 scanned document is described. The system and method is a reversible background 

noise removal technique that allows a user to select whether background noise is 
20 removed or not. In addition, since the present invention divides the background noise 
removal operations into a two phase process, the overall efficiency of the system and 
method are significantly improved. 

[0009] According to a first embodiment of the system and method, during a first phase 
(i.e., prior to image storage) statistical data is obtained and a background noise 

25 removal tonemap function is derived and during a second phase (i.e., after image 

storage) background noise removal is performed using the tonemap derived in the first 
phase. In particular, during the first stage, statistical data is obtained from the scanned 
document image data and the background noise removal tonemap function for the 
entire image is derived from the statistical data. The image data and the tonemap 

30 function are then stored. During the second stage and upon user selection, background 
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noise is removed by retrieving the image data and tonemap function and using the 
tonemap function to convert pixel values in the digital image. Alternatively, during 
the second stage the user may also select to bypass background noise removal prior to 
image rendering. In one embodiment, the statistical data corresponds to an estimated 
5 global background tone value. 

[0010] According to this embodiment, the scanned document image data can be pre- 
processed, for instance enhanced, while determining the statistical data prior to storing 
the image data and the tonemap function or the statistical data. In another 
embodiment, the processed image data and a look-up table (LUT) corresponding to 
10 the tonemap are stored. In still another embodiment, the LUT and the processed 
image are stored according to a predetermined data format 

g| [0011] According to a second embodiment of the system and method, during the first 

i P hase ( i e > P rior to image storage) statistical data is obtained and during the second 

S stage (i.e., after image storage) the background noise removal tonemap function is 

15 derived (using the statistical data obtained in the first phase) while performing 
background noise removal and upon user selection. In particular, during the first 
J3 phase, statistical data is obtained from the document image data. The image data and 

r| the statistical data are then stored. During the second stage and upon user selection, 

|j background noise is removed by retrieving the image data and statistical data and a 

fy 20 background noise removal tonemap function is derived. The tonemap function is used 
to convert pixel values in the digital image. Alternatively, during the second stage the 
user may also select to bypass background noise removal in which case, the tonemap 
function is not derived. 

[0012] According to this embodiment, the scanned document image data can be 
25 processed, for instance enhanced, while determining the statistical data prior to storing 
the image data and the statistical data. The statistical data can include but is not 
limited to histograms of the image data and an estimated global background tone 
value. 

[0013] Other aspects and advantages of the present invention will become apparent 
30 from the following detailed description, taken in conjunction with the accompanying 
drawings, illustrated by way of example of the principles of the invention. 
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BRIEF DESCRIPTI ON OF THF DRAWINGS 

[0014] Fig. 1A illustrates a first prior art processing system including irreversible 
background noise removal; 

[0015] Fig. IB illustrates a second prior art processing system which performs all 
statistical background analysis and background noise removal after image storage; 

[0016] Fig. 2A illustrates a functional block diagram of a first phase of a first 
embodiment of background noise removal according to the system and method of the 
present invention; 

[0017] Fig. 2B illustrates a functional block diagram of a second phase of the first 
embodiment of background noise removal according to the system and method of the 
present invention; 

[0018] Fig. 3 illustrates a first embodiment of the method of background noise 
removal of a scanned document digital image; 

[0019] Fig. 4A illustrates a functional block diagram of a first phase of a second 
embodiment of background noise removal according to the system and method of the 
present invention; 

[0020] Fig. 4B illustrates a functional block diagram of a second phase of the second 
embodiment of background noise removal according to the system and method of the 
present invention; 

[0021] Fig. 5 illustrates a second embodiment of the method of background noise 
removal of a scanned document digital image; 

[0022] Fig. 6A shows an example of an edge luminance histogram used for estimating 
a global background tone value when performing statistical analysis according to one 
embodiment of the present invention; 

[0023] Fig. 6B shows an example of luminance histogram corresponding to all image 
pixels used for estimating a global background tone value when performing statistical 
analysis according to one embodiment of the present invention; 
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[0024] Fig. 7 shows a limiting function used when performing statistical analysis 
according to one embodiment of the present invention; and 

[0025] Fig. 8 shows an example of a tonemap function in which Y b =220. 
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DETAILED DESCRIPTION OF THE INVENTION 

[0026] A system and method of background noise removal from scanned document 
image data, is described. In general, the present invention provides a system and 
5 method for efficiently performing user selected background noise removal on a 
scanned document image. 

[0027] For purposes of this description, document image data is. generally defined as 
image data including textual, graphical, and natural image data. The background of a 
scanned document image, is generally defined as the predominant area other than the 
M 10 text, graphics, and natural images included within the image. Background noise is 
p generally defined as any image data within the background area of the image that was 



vj not in the original image of the document or other unintended undesirable image data 

II! within the background area. 



SI 
If! 

m 

g ? [0028] The present invention is based on a document scanning technique in which the 

|j 15 entire document image data is stored at some point in the process such as shown in 
fCf Fi S- 1B - However, unlike the system shown in Fig. IB, the system and method of the 

gj present invention can be viewed as splitting the background noise removal operation 

Til into two phases. The first phase is generally defined to occur while the document is 

scanned and prior to when the entire document image data is initially stored. The 
20 second phase is generally defined to occur after the image data is loaded from file 
storage and prior to image display or printing. According to the present invention, 
certain processes of the background noise removal operation are performed within 
each Of the "first and second phases in order to enhance overall system performance 
and efficiency and user experience. It should be understood that when loading image 
25 data from file storage the image data is transferred from a storage area such as disk 

storage into main memory of a computing unit or other storage area that is adapted for 
ease of accessibility of the data while image processing. It should be further 
understood that prior to file storage the image data is typically compressed and upon 
loading from file storage the data is decompressed. 
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[0029] Figs. 2A and 2B show functional flowcharts of first and second phases, 
respectively, of a first embodiment of the present invention. Referring to Fig. 2A, 
initially, statistical analysis 21 is performed on the scanned document image data. 
Statistical analysis can be achieved by performing a variety of statistical operations 
5 and calculations and can generate a corresponding variety of statistical results. For 
instance, some statistical operations can include estimation of tone probability 
distribution conditional on aspects of the image data such a&edge location and size of 
connected components. Statistical operations can also include any estimation of tone 
probability using other statistical methods. Some statistical results can include the 
10 main modes of the tone probability distribution (e.g., the brightest mode corresponds 
to the global background tone value). A preferred statistical analysis is described 

U herein below. According to one embodiment, statistical analysis is performed wherein 

P an estimated global background tone value is determined. 

8 

S [0030] From the statistical data obtained from the statistical analysis, a background 

gj 15 noise removal tonemap function is determined. A tonemap function is determined ? 



2 

ST' 



that can identify pixel values corresponding to the background using the statistical 
data obtained from module 21 and maps the identified pixel value to a new value 

fl] 

corresponding to a selected background color (e.g., white) by setting all of its channels 
pj to a value of a value corresponding to that color. In the case in which the statistical 

ffj 20 analysis is performed to obtain an estimated global background tone value, then the 
tonemap function is determined based on that value. The tonemap function is then 
saved as shown by data storage module 24. In an alternative embodiment (not shown), 
a look-up table corresponding to the background noise removal tonemap function is 
generated and saved. 

25 [0031] Optionally, the scanned document image data may be pre-processed by image 
processing module 22 while performing statistical analysis 21 and tonemap 
determination 23. In general statistical data can be collected with little overhead 
during other image processing or enhancement operations which calculate 
intermediate data such as edge probability which can be used to estimate a probability 

30 distribution either conditional or not. Hence, one advantage of the subject invention is 
that time consuming computations which are needed by both the image processing 
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module 22 and the statistical analysis module 21 are performed only once and the 
results can be used for both background noise removal and image enhancement 
operations. This is in contrast to the prior art system shown in Fig. IB where these 
computations are performed twice, once before data storage for image enhancement 
and once after for background noise removal. Image pre-processing can include but is 
not limited to filtering so as to selectively sharpen and smooth the image. Once pre- 
processed, the image data is stored with the background noise removal tonemap 
function. 

[0§32] Fig. 2B shows the second phase of the first embodiment of the present 
invenhqn. In this phase the stored information is retrieved including the background 
noise remoNaT tonemap and the image data (that may or may not be pre-processed). 
The retrieved d^ais provided to the background noise removal module 25 which 
includes a pixel coiiWion module 25A. The pixel conversion module 25A converts 
the image data pixel valufe^using either the background noise removal tonemap or an 
idle tonemap dependent on use^election module 24. The idle tonemap corresponds 
to an identity function such that theNgixels values remain the same. The background 
noise removal tonemap function maps fWl values identified as background to a 
selected pixel value effectively removing anWiwanted background noise. In the case 
in which the background noise removal LUT is stored by data storage module 24, then 
pixel conversion 25A is performed by indexing intofte noise removal LUT with each 
of the image data pixel values to obtain new pixel values* 

[0033] It should be noted that the image stored by data storage module 24 includes its 
original background pixel data. Consequently, during the second phase, the original 
background data has not been lost and can be provided to the user dependent on user 
preference. 

[0034] Splitting the background noise removal operation into these two phases, 
facilitates reduction of repeated statistical operations that are characteristic of prior art 
techniques. Hence, the majority of the background-removal computation occurs within 
the first phase. In the second phase, performed after loading the image from storage, 
background-removal is applied only by user selection. The computation involved in 
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the second phase is minimal. Thus, the user can efficiently toggle the background 
noise removal function on and off. For example, it is possible to view and print the 
image with its original background, and view and print the same image with its 
background noise removed. 

5 [0035] In addition, according to the present invention the background-removal 

operation is reversible. The image is stored without applying background-removal, but 
with additional tone-mapping data, which enables an efficient rendering of the image 
either with background removal or without it. Thus, the user can have it both ways for 
every single page. In addition, compared to previous "single pass" solutions the 
10 overall computational efficiency is substantially improved, and artifacts rooted in non- 
uniform background-tone estimation, are avoided. 



Q [0036] Fig. 3 illustrates a method corresponding to the technique shown in Figs. 2A 

Jj and 2B. According to this method, initially, scanned document image data is analyzed 

*1 to obtain statistical data 30. Based on the statistical data, a background noise removal 

j>* 15 tonemap function is derived 3 1 . According to Fig. 2A, the image data is analyzed 30 
^ and the tonemap is derived 31 in the first phase. The image data and the tonemap are 

flj then stored 32. Optionally, the image data may be pre-processed (not shown) prior to 

storage. The user is then allowed to select 33 between a first case in which 
background noise is removed by using the tonemap to convert the image pixels 34A 
20 and a second case in which background noise removal is bypassed 34B. 

[0037] Figs. 4A and 4B show functional flowcharts of first and second phases, 
respectively, of a second embodiment of the present invention. Referring to the first 
phase shown in Fig. 4 A, initially, statistical analysis 41 is performed on the scanned 
document image data. As with the embodiment shown in Fig. 2A, the image data may 
25 be simultaneously pre-processed by image processing module 42 where its 

intermediate computation results can be used by the statistical analysis module 41 to 
perform statistical estimations. In contrast with the embodiment shown in Fig. 2A, 
the statistical data obtained from the analysis 41 is stored by data storage module 43 
instead of being used to derive a background noise removal tonemap. Both the image 
30 data (pre-processed or unprocessed) are stored by the data storage module 43. In one 
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embodiment, the statistical data stored by module 43 can be but is not limited to at 
least one luminance histograms of the image data or an estimated global background 
tone value. 

[0038] In the second phase shown in Fig. 4B, the stored statistical data and image data 
5 are retrieved and provided to the background noise removal module 45. The 

background noise removal module 45 include background noise removal tonemap 
determination 45 A and pixel conversion module 45B. The user selection module 44 
allows the user to determine whether background noise is to be removed or not. If the 
user selects background noise removal, module 45A determines a background noise 
10 removal tonemap function from the statistical data. In a preferred embodiment (not 
shown), a look-up table corresponding to the background noise removal tonemap 
Q function is generated and saved. The pixel conversion module 45B then uses either 

J the stored background noise removal tonemap or LUT to remove the background 

Sj noise from the scanned document image data. Alternatively, the user can select not to 

y] 

15 remove background noise. If so, the idle tonemap is used by pixel conversion module 
45B such that the rendered and printed image retains all background information. 

[0039] Fig. 5 illustrates a method corresponding to the technique shown in Figs. 4A 
and 4B. According to this method, initially, scanned document image data is analyzed 
to obtain statistical data 55. Then, the image data and the tonemap are stored 56. 
20 Hence, in the first phase of this method, only statistical data is obtained and stored 
with the image data. As described above, the image data may be pre-processed (not 
shown) prior to storage. The user is then allowed to select 47 between a first case in 
which background noise is removed 58A or a second case in which background noise 
removal is bypassed 58B. In the first case, the background noise is removed by first 
25 deriving a background noise removal function from the retrieved statistical data and 
then converting the pixel values using the derived tonemap function. 

[0040] In one embodiment of the invention as shown in Figs. 2-5, the user can view 
the tonemapped image data in a display area (e.g., using a browser), prior to printing 
to determine which image (i.e., background noise removed or no background noise 
30 removed) is preferred. To enhance this option, once the image has been tonemapped 
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to remove background noise it is stored in cache. This allows the user to switch 
between viewing the original image (stored in cache) and the background noise 
removed image (also stored in cache) without requiring any additional processing. As 
a result, the user is provided with both options immediately once the initial 
5 tonemapping is performed. 

[0041] It should be understood that according to the systems and methods shown in 
Figs. 2-5, background noise removal may be performed prior, during, or after 
rendering of the image data. In one preferred embodiment, background noise removal 
operations performed by modules 25 and 35 (Figs. 2B and 4B) are embedded into the 
10 image data rendering process. For example, in one preferred embodiment, when a 
U user retrieves a document image from file storage for rendering, a background 

removal data object is created which includes the background removal LUT (LUT B r) 
and stored user options. Initially, if the user has not previously retrieved the document 
image from file storage, the user options are set to a default setting in which no 
Rj 15 background removal is performed. In this case, the document image is automatically 
2 processed by background noise removal modules 25 or 35 using the idle tonemap (i.e., 

n " 

fij the identity tonemap) and the processed image is stored in cache for easy access. At 

W this point the user can select on a page-by-page basis whether to apply background 

P noise removal using the browser. In one embodiment, the browser provides three 

^ 20 options to the user including: 1) print according to currently stored user options; 2) 
remove background noise from all pages in the file; and 3) no noise removal. Once 
selected, the user options for each page is stored in the background removal data 
object and background noise removal is performed as described in conjunction with 
the embodiments shown in Figs. 2B and 4B and is also stored in cache fo^easy 
25 accessibility. Once a page has been processed to remove noise both the original image 
and the noise removed image are stored in cache allowing the user to "toggle" 
between the images without further processing to obtain the best results. 

[0042] In another preferred embodiment, the prioritized document format is used to 
store the image data and statistical data, background noise removal tonemap function 
30 or background noise removal tonemap LUT. In particular, the format is implemented 
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for storage of multiple data types such an image data, statistical data, LUTs, functions, 
etc. In one embodiment, the document format comprises various streams 
representing, among other things, the image data, the tonemap function, etc. 
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[00H3] In one embodiment, the modules and processes shown in Figs. 2-5 can be 
performed in a similar or modified manner as described in U.S. Application No.: 
09/704358Wled "System and Method for Enhancing Scanned Document Images 
for Color PrintW filed November 1, 2000 and assigned to the assignee of the subject 
application and inc\porated herein by reference. Accordingly, the scanned document 
image data can be initi^color converted. For instance, the scanned document 
image data may be color c\erted from a RGB color space to a YC b C r color space. 
In this embodiment, the YC b QcW space is better adapted for obtaining statistical 
g information about the background ohfce scanned document image data. More 

| particularly, statistical analysis can be limited to the luminance channel for simplified 

S| computations and adequate approximation resb^ Other color spaces having a 

jjjj 15 separate luminance component may also be utilizec 

i 

' [0044] On one preferred embodiment, the desired statistical result of the statistical 

Si analysis module (21,41) is an estimated global background tone value Y b . It should be 

S understood that the global background tone value can be estimated from all three of 

|j the YCbC ' color channels. However, satisfactory results can be obtained using the 

20 luminance channel (Y) to estimate the luminance portion of the background tone value 
and using that to process image data for background noise removal. Consequently, the 
remainder of the statistical analysis will only be described in terms of the luminance 
channel. 
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[0045] The estimated tone background value Y b , is determined by initially performing 
edge detection of the luminance value of all pixels in the image data. Edge detection 
operates to detect edges of salient features such as text contained in the image data. 
Initially, two metrics Dl and D2 (Eqs. 1 and 2) are determined. Metric Dl 
corresponds to first-order derivative and metric D2 corresponds to a second-order 
derivative. Namely, Dl is a "morphological gradient", and D2 is a "morphological 
laplacian". The metric Dl is used to determine whether a pixel of interest is a part of 
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a salient edge such as an edge between text and background. The metric D2 is used to 
determine whether an edge pixel is on the dark side or the bright side of the edge. The 
two metrics are calculated by determining for each pixel the minimum luminance (m) 
and maximum luminance (M) values in a 3x3 punctured neighborhood of the pixel 
5 (i.e. the eight nearest neighbors, excluding the pixel itself). In this case Dl is given by 
Eq.l: 

(1) Dl =M-m; 
and D2 is given by Eq. 2: 

(2) D2 = M+m-2*Y\ 

Lj 10 where Y' is either the original luminance value Y, or preferably the luminance value 

Q of a smoothed luminance image in the same location. 

W 
sj 

y-j [0046] To determine if a pixel is part of an edge its corresponding first metric, D 1 , is 

|1 compared to a threshold value T e . That is, a pixel is part of an edge if Dl> T e . 

Furthermore, to determine if the pixel is part of the dark side or the bright side its 
corresponding second metric, D2, is compared to zero. Specifically, if D2>0 the pixel 
jjj is on th e dark side; else, the pixel is in the bright side. 

O 

fy [0047] The threshold T e is made adaptive by setting a minimum (initial) value 

(Te_min) and letting the threshold float with the maximum value of the metric Dl 
(i.e., Dl_max) as more image data is processed. Dl_max is then updated after 
20 processing each line according to the following condition: if T e <k*Dl_max then 
T e =k*Dl_max. This condition ensures that the threshold T e can only go up. The 
minimum value of T e (i.e. , T^min) and the parameter k should be determined by 
experiments. In the preferred embodiment they are set to Te_min=0, k=0.25. 

[0048] The estimated background luminance is determined using two histograms 
25 computed by statistical analysis of the image data. One of the histograms corresponds 
to all pixel luminance values and is denoted H a , and the other for the histograms 
corresponds to luminance values of pixels on the bright side of edges (as determined 
above) and is denoted H e . Figs. 6A and 6B show examples of an edge luminance 
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histogram H t and the histogram H a corresponding to all pixel values. A thresholding 
algorithm, referred to as the Kittler-Illingworth algorithm is used to analyze the H e 
histogram and based on this analysis a background threshold value is estimated. The 
Kittler-Illingworth algorithm as applied to background tone estimation is described in 
5 U.S. Application No.: 09/704358 entitled "System and Method for Enhancing 

Scanned Document Images for Color Printing" filed November 1, 2000 and assigned 
to the assignee of the subject application reference above. In particular, the Kittler- 
Illingworth algorithm finds within each histogram a valley that corresponds to a 
minimum error threshold value. The minimum error threshold value corresponds to a 
10 threshold value that minimizes a global error measure of replacing all gray level 

values with binary values, (i.e., replacing pixel values higher than the threshold with a 
M pixel value of 255 and replacing pixel values lower than the threshold with a pixel 

g value of 0). This optimal minimum error threshold value occurs within the main 

W valley of each histogram as approximately shown by indicators 60 and 61 in Figs. 6A 

Lfi 15 and6B. 

m 

b [0049] Using the minimum error threshold value 60 for the edge histogram H t , a 

Q threshold value W e corresponding to the peak 62 between the valley (i.e., minimum 

ft| error threshold value) and 255 is determined for the edge histogram H e . From the 

| peak luminance value for the edge histogram H e , a I high value and a I low value is 

n| 20 determined corresponding to the points where histogram H e falls to half of the peak 

value (63 and 64, Fig. 6A). Next, a spread index is determined from the I high and I low 

values for histogram H e . The spread index is defined as: 

(3) S e ={hi S h-Iio W )/256 

The spread index is used as a confidence measure of the estimate. 



25 



[0050] As described above, a minimum error threshold value is obtained for histogram 

using the Kittler-Illingworth algorithm. This minimum error threshold value (61, 
Fig. 6B) is designated as threshold value W a . The threshold values, W e and W a , are 
shown in Figs. 6 A and 6B by indicators 62 and 61. 
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[0051] A minimum acceptable estimated value MIN_ WHITE is established for the 
background threshold information W e and W a by embedding MEN_WHrTE in a non- 
linear function/. That is W e '=f(W e ) and W a '=f(W a ). For images with mostly black 
figures and little unprinted regions, the background threshold information W e and W a 
are usually far below the minimum acceptable value. In these cases, it may be 
desirable to reduce the effect of background removal that will be performed on the 
image, or even to not perform background removal. This can be effectively realized 
by functions such as the following limiting function. 



(4) /(v) = ■ 



v, if v > MIN _ WHITE 

(255.0 - MIN _ WHITE) ■ exp(-(r • MIN _ WHITE 1 255 • vl 255) 2 ) 
+ MIN_WHITE, otherwise 



gj 10 for v-(W e , W a ). The limiting function f(v) with MIN_WHITE=220 and r=3.5 is shown 
in Fig. 7. Other functions characterized by three distinct regions may also be used to 
achieve a similar effect. 



[0052] Next, the estimated background luminance Y b is obtained according to Eqs. 5 
jjj and 6 by a weighted averaging of the two estimations, W e ' and W a \ obtained from the 

nj 15 above function (Eq. 4). 

m ~ 

Si < 5 > Y »=Pe ■ W e ' + (l-p e ) ■ Wa 

(6) where p e = (1- 5 e ) • tanh(a • N</N a ) 

In a preferred embodiment a = 4000. The statistical confidence, p e , increases with 
diminishing spread index of the edge histogram (S e ), and with increasing population of 
light edge pixels (N e ) relative to the general pixel count N a (Eq. 6). In the preferred 
embodiment, Eq. 6 is used for edge related weighting. Note that the weight can obtain 
only values between 0 and 1. 

[0053] Once an estimated global background tone value is obtained by either of 
statistical modules 21 or 41 (Figs. 2A and 4A, respectively) it is in one embodiment 
25 provided to the background noise removal tonemap derivation module 23 in 
accordance with Figs. 2A and 3 or it is stored by the data storage module 43 in 



20 
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accordance with Figs. 4A and 5. A background noise removal tonemap is constructed 
according to the following considerations: 

- For an effective bleaching of the background it is preferred that all pixels with 
luminance lighter than Y b map to pure white: Q(Y>Y b ) =255. 

- In order to avoid unnecessary lightening of pixels which are much darker than 
the background luminance, the mapping should to converge to identity when 
Yb-Y»255-Yb: Q(Y : AiL.»i )s= y 

- The transition between the mapping of very light and very dark luminance should 
be graceful to avoid artifacts. 

The tone-mapping function used in one preferred embodiment which fulfills the above 
requirements is: 



Fig. 8 shows the above function in which Y b =220. 

[0054] It should be understood that the modules shown in Figs. 2A, 2B, 4A and 4B 
can be implemented as any combination of software, hardware, or firmware. 

[0055] In the preceding description, numerous specific details are set forth in order to 
provide a thorough understanding of the present invention. It will be apparent, 
however, to one skilled in the art that these specific details need not be employed to 
practice the present invention. In addition, it is to be understood that the particular 
embodiments shown and described by way of illustration is in no way intended to be 
considered limiting. Reference to the details of these embodiments is not intended to 
limit the scope of the claims. 
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